Learning DFA: evolution versus evidence driven state merging
نویسندگان
چکیده
Learning Deterministic Finite Automata (DFA) is a hard task that has been much studied within machine learning and evolutionary computation research. This paper presents a new method for evolving DFAs, where only the transition matrix is evolved, and the state labels are chosen to optimize the fit between final states and training set labels. This new procedure reduces the size and in particular, the complexity, of the search space. We present results on the Tomita languages, and also on a set of random DFA induction problems of varying target size and training set density. The Tomita set results show that we can learn the languages with far fewer fitness evaluations than previous evolutionary methods. On the random DFA task we compare our method with the Evidence Driven State Merging (EDSM) algorithm, which is one of the most powerful known DFA learning algorithms. We show that our method outperforms EDSM when the target DFA is small (less than 32 states) and the training set is sparse.
منابع مشابه
Results of the Abbadingo One DFA Learning Competition and a New Evidence-Driven State Merging Algorithm
This paper first describes the structure and results of the Abbadingo One DFA Learning Competition. The competition was designed to encourage work on algorithms that scale well—both to larger DFAs and to sparser training data. We then describe and discuss the winning algorithm of Rodney Price, which orders state merges according to the amount of evidence in their favor. A second winning algorit...
متن کاملMutually Compatible and Incompatible Merges for the Search of the Smallest Consistent DFA
State Merging algorithms, such as Rodney Price’s EDSM (Evidence-Driven State Merging) algorithm, have been reasonably successful at solving DFA-learning problems. EDSM, however, often does not converge to the target DFA and, in the case of sparse training data, does not converge at all. In this paper we argue that is partially due to the particular heuristic used in EDSM and also to the greedy ...
متن کاملEvidence Driven State Merging with Search
During last year's Abbadingo competition, two people showed how to improve the average-case performance of state merging algorithms for DFA learning. Rodney Price obtained a big improvement by discovering a good order for performing merges. Hugues Juill e obtained a bigger improvement at greater cost by wrapping a stochastic search around a state merging algorithm. Here we establish a new high-...
متن کاملGradual probabilistic DFA learning with caching for conversational agents
SUMMARY This paper proposes a method of reducing the cost of gradually constructing task-oriented conversational agents with FSMs by collecting example dialogues. Probabilistic-DFA learning algorithms with the state merging method are available for the FSM-based dialogue model. However, these algorithms must learn the whole model again as often as the example data increase. We proposed a learni...
متن کاملA Stochastic Search Approach to Grammar Induction
This paper describes a new sampling-based heuristic for tree search named SAGE and presents an analysis of its performance on the problem of grammar induction. This last work has been inspired by the Abbadingo DFA learning competition [14] which took place between Mars and November 1997. SAGE ended up as one of the two winners in that competition. The second winning algorithm, rst proposed by R...
متن کامل